AITopics | enable robust deep multimodal analysis

Collaborating Authors

enable robust deep multimodal analysis

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Implicit Differentiable Outlier Detection Enable Robust Deep Multimodal Analysis

Neural Information Processing SystemsDec-24-2025, 09:36:50 GMT

Deep network models are often purely inductive during both training and inference on unseen data. When these models are used for prediction, but they may fail to capture important semantic information and implicit dependencies within datasets. Recent advancements have shown that combining multiple modalities in large-scale vision and language settings can improve understanding and generalization performance. However, as the model size increases, fine-tuning and deployment become computationally expensive, even for a small number of downstream tasks. Moreover, it is still unclear how domain or prior modal knowledge can be specified in a backpropagation friendly manner, especially in large-scale and noisy settings.

electronic proceedings, enable robust deep multimodal analysis, name change, (5 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.56)
Information Technology > Artificial Intelligence > Natural Language (0.39)

Add feedback

Implicit Differentiable Outlier Detection Enable Robust Deep Multimodal Analysis

Neural Information Processing SystemsOct-10-2024, 19:56:49 GMT

enable robust deep multimodal analysis, explicit knowledge, vision and language, (2 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.59)
Information Technology > Data Science > Data Mining > Anomaly Detection (0.44)

Add feedback

Differentiable Outlier Detection Enable Robust Deep Multimodal Analysis

Wang, Zhu, Medya, Sourav, Ravi, Sathya N.

arXiv.org Artificial IntelligenceFeb-11-2023

Often, deep network models are purely inductive during training and while performing inference on unseen data. Thus, when such models are used for predictions, it is well known that they often fail to capture the semantic information and implicit dependencies that exist among objects (or concepts) on a population level. Moreover, it is still unclear how domain or prior modal knowledge can be specified in a backpropagation friendly manner, especially in large-scale and noisy settings. In this work, we propose an end-to-end vision and language model incorporating explicit knowledge graphs. We also introduce an interactive out-of-distribution (OOD) layer using implicit network operator. The layer is used to filter noise that is brought by external knowledge base. In practice, we apply our model on several vision and language downstream tasks including visual question answering, visual reasoning, and image-text retrieval on different datasets. Our experiments show that it is possible to design models that perform similarly to state-of-art results but with significantly fewer samples and training time.

artificial intelligence, enable robust deep multimodal analysis, machine learning, (2 more...)

arXiv.org Artificial Intelligence

2302.05608

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.87)
Information Technology > Artificial Intelligence > Natural Language (0.87)
Information Technology > Communications > Networks (0.53)
(2 more...)

Add feedback